Extracting Definitions of Mathematical Expressions in Scientific Papers
نویسندگان
چکیده
Natural language definitions of mathematical expressions are essential for understanding the mathematical content of scientific papers. A textual description corresponding to a mathematical expression determines the type of symbol or function and the specific name for reference. Our objective is to create an automatic way of extracting definitions of mathematical expressions. We needed to create an annotated corpus since there was no annotated data available on relations between mathematical expressions and their definitions and such annotated data would enable us to compare different approaches to the relation extraction task. This paper introduces guidelines for annotating definitions of mathematical expressions. By using 14 manually annotated papers from Springer, we investigated pattern matching and machine learning based methods in comparison with naive practice based on the nearest noun of the preceding text. The result shows potential of our approach in detecting definitions and the usefulness of our annotated data.
منابع مشابه
Contextual Analysis of Mathematical Expressions for Advanced Mathematical Search
We found a way to use mathematical search to provide better navigation for reading papers on computers. Since the superficial information of mathematical expressions is ambiguous, considering not only mathematical expressions but also the texts around them is necessary. We present how to extract a natural language description, such as variable names or function definitions that refer to mathema...
متن کاملرعایت اصول صحیح مقاله نویسی در مقالات چاپ شده توسط اعضای هیات علمی دانشگاه علوم پزشکی شیراز طی سالهای 1381 تا 1386
Background and Objectives: Publication of scientific articles nowadays is one of the important indexes of knowledge production. This index plays a key role for ranking in academia. The aim of this study was to assess the how academic staff in Shiraz Medical Sciences University considered the principle of scientific writing. Methods: In a cross-sectional study 200 published papers among 1104 ...
متن کاملLogical Structure Analysis of Scientific Publications in
Even though the Linking Open Data cloud is constantly growing, there is a serious lack of published data sets related to the domain of academic mathematics. At the same time, since most scholarly publications in mathematics are well-structured and conventional, it’s promising to get their helpful detailed representation. The paper describes an approach to extracting and analyzing the structure ...
متن کاملA Tagged Corpus for Automatic Labeling of Disabilities in Medical Scientific Papers
This paper presents the creation of a corpus of labeled disabilities in scientific papers. The identification of medical concepts in documents and, especially, the identification of disabilities, is a complex task mainly due to the variety of expressions that can make reference to the same problem. Currently there is not a set of documents manually annotated with disabilities with which to eval...
متن کاملMedicalization Defined in Empirical Contexts – A Scoping Review
Background Medicalization has been a topic of discussion and research for over four decades. It is a known concept to researchers from a broad range of disciplines. Medicalization appears to be a concept that speaks to all, suggesting a shared understanding of what it constitutes. However, conceptually, the definition of medicalizat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012